Skip to content

feat: add LabelModelGrader support for OpenAI Evals backend#137

Merged
krisztianfekete merged 3 commits into
agentevals-dev:mainfrom
mesutoezdil:feat/label-model-grader
May 15, 2026
Merged

feat: add LabelModelGrader support for OpenAI Evals backend#137
krisztianfekete merged 3 commits into
agentevals-dev:mainfrom
mesutoezdil:feat/label-model-grader

Conversation

@mesutoezdil
Copy link
Copy Markdown
Contributor

@mesutoezdil mesutoezdil commented May 5, 2026

Closes #97

Adds label_model as a second grader type next to text_similarity.

label_model scores responses without a golden set. The grader config holds the model, input template, labels, and passing_labels. Items sent to the API include only actual_response.

Tests are in tests/test_openai_eval_backend.py.

@mesutoezdil mesutoezdil force-pushed the feat/label-model-grader branch 4 times, most recently from 3873cfd to 88b707f Compare May 12, 2026 18:03
@mesutoezdil
Copy link
Copy Markdown
Contributor Author

@krisztianfekete ready for review when you have time.

Copy link
Copy Markdown
Contributor

@krisztianfekete krisztianfekete left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, it's mostly good, added a couple of comments!

Comment thread src/agentevals/config.py
Comment thread examples/custom_evaluators/eval_config.yaml Outdated
Comment thread src/agentevals/openai_eval_backend.py Outdated
Adds label_model grader type, validates passing_labels against labels,
moves OpenAI grader example to a separate file.
@mesutoezdil mesutoezdil force-pushed the feat/label-model-grader branch from 88b707f to 9efd28b Compare May 13, 2026 19:44
@mesutoezdil
Copy link
Copy Markdown
Contributor Author

@krisztianfekete ready for review.

@krisztianfekete
Copy link
Copy Markdown
Contributor

Please fix the linter, and please make sure not to force-push during reviews to make the process easier.

@mesutoezdil mesutoezdil force-pushed the feat/label-model-grader branch from 06c6e86 to 50fa8cf Compare May 15, 2026 11:05
@mesutoezdil
Copy link
Copy Markdown
Contributor Author

Will not do this again during review..

@krisztianfekete krisztianfekete merged commit 8868017 into agentevals-dev:main May 15, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add LabelModelGrader OpenAI Grader

2 participants